Improving Multivariate Microaggregation through Hamiltonian Paths and Optimal Univariate Microaggregation
نویسندگان
چکیده
The collection of personal data is exponentially growing and, as a result, individual privacy endangered accordingly. With the aim to lessen risks whilst maintaining high degrees utility, variety techniques have been proposed, being microaggregation very popular one. Microaggregation family perturbation methods, in which its principle aggregate records (i.e., microdata) groups so preserve through k-anonymity. multivariate problem known be NP-Hard; however, univariate version could optimally solved polynomial time using Hansen-Mukherjee (HM) algorithm. In this article, we propose heuristic solution inspired by Traveling Salesman Problem (TSP) and optimal solution. Given dataset, first, apply TSP-tour construction generate Hamiltonian path all dataset records. Next, use order provided given permutation records) input algorithm, virtually transforming it into solver call Multivariate (MHM). Our intuition that good solutions TSP would yield paths allowing algorithm find problem. We tested our method with well-known benchmark datasets. Moreover, show usefulness approach protecting location privacy, real-life trajectories datasets, too. compared results those best performing solutions, proposal reduces information loss resulting from microaggregation. Overall, suggest counterpart ordering microdata proper applying an leads reduction error keeping same guarantees.
منابع مشابه
A Polynomial Algorithm for Optimal Univariate Microaggregation
Microaggregation is a technique used by statistical agencies to limit disclosure of sensitive microdata. Noting that no polynomial algorithms are known to microaggregate optimally, Domingo-Ferrer and Mateo-Sanz have presented heuristic microaggregation methods. This paper is the first to present an efficient polynomial algorithm for optimal univariate microaggregation. Optimal partitions are sh...
متن کاملImproving the Utility of Differential Privacy via Univariate Microaggregation
Differential privacy is a privacy model for anonymization that offers more robust privacy guarantees than previous models, such as k-anonymity and its extensions. However, it is often disregarded that the utility of differentially private outputs is quite limited, either because of the amount of noise that needs to be added to obtain them or because utility is only preserved for a restricted ty...
متن کاملImproved Univariate Microaggregation for Integer Values
Privacy issues during data publishing is an increasing concern of involved entities. The problem is addressed in the field of statistical disclosure control with the aim of producing protected datasets that are also useful for interested end users such as government agencies and research communities. The problem of producing useful protected datasets is addressed in multiple computational priva...
متن کاملOptimal Multivariate 2-Microaggregation for Microdata Protection: A 2-Approximation
Microaggregation is a special clustering problem where the goal is to cluster a set of points into groups of at least k points in such a way that groups are as homogeneous as possible. Microaggregation arises in connection with anonymization of statistical databases for privacy protection (k-anonymity), where points are assimilated to database records. A usual group homogeneity criterion is wit...
متن کاملA polynomial-time approximation to optimal multivariate microaggregation
Microaggregation is a family of methods for statistical disclosure control (SDC) of microdata (records on individuals and/or companies), that is, for masking microdata so that they can be released without disclosing private information on the underlying individuals. Microaggregation techniques are currently being used by many statistical agencies. The principle of microaggregation is to group o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Symmetry
سال: 2021
ISSN: ['0865-4824', '2226-1877']
DOI: https://doi.org/10.3390/sym13060916